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3 <110> APPLICANT: Hoshino , Tatsuo 

4 Miyazaki , Taro 

5 Sugisawa, Teruhide 

7 <12 0> TITLE OF INVENTION: Aldehyde Dehydrogenase Gene 

9 <130> FILE REFERENCE: C038435/0185653 
C--> 11 <140> CURRENT APPLICATION NUMBER: US/10/528 , 675 f 
C--> 11 <141> CURRENT FILING DATE: 2005-03-23 f\ \s\ 

11 <160> NUMBER OF SEQ ID NOS : 9 If * 

13 <170> SOFTWARE: Patent In version 3.3 ^ft 

15 <210> SEQ ID NO: 1 

16 <211> LENGTH: 3408 

17 <212> TYPE: DNA 

18 <213> ORGANISM: Gluconobacter oxydans 

20 <400> SEQUENCE: 1 

21 gcgactggca gcagcgcaac tatgaccact atggcctgcc gccctattgg atctaactga 60 
23 tccagtaagc caccatcagc cggcccctgc gggggccggc tttttgcgct agaccccgcc 120 
25 gaggtgctgt cgtaacctaa ggtcacatct ttacttccac atccgccctt gtcagttctg 180 
27 acgtgacaaa ttgtcgcggt catgctgctg aatgcggatg ccagtcccag atccaagccc 240 
29 gacgcaagga gacgtagatg ttacccaaat cattgaaaca taagaatggc gccatgcgcc 3 00 
31 ttgtcgcagc ctcgaccctt gcgctgatga tcggcgcggg tgcccatgcg caggtaaacc 360 
33 cggtcgaagt gccggtgggc gcgaacgaga cctttacctc gcgcgtgctg accaccggcc 42 0 
35 tgtcgaaccc ttgggaaatc acctggggcc ccgacaatat gctgtgggtg accgagcgat 480 
37 cttccggcga agtgacgcgc gtcgacccca ataccggcga gcagcaggtc ctgctgaccc 540 
39 tgaccgattt cagcgtcgat gtgcaacacc agggcctact tggcctcgcg ctgcatcctg 600 
41 agtttatgca agagagcggc aacgactacg tctatatcgt ctacacttat aacaccggca 660 
43 ccgaagaagc gcccgatccg catcaaaagc tggtgcgtta tgcctatgac gctgccgcgc 720 
45 agcagctggt cgatccggtt gatctggtcg caggcattcc cgcaggcaac gaccacaatg 780 
47 gcggtcgcat caaattcgcc cccgatggcc aacacatctt ttacacgctg ggcgagcaag 840 
49 gcgcgaactt tggcggtaac ttccgccgtc cgaaccacgc gcaactgctg ccgacgcaag 900 
51 agcaggtcga cgcgggcgat tgggtcgcct attcgggcaa gatcctgcgc gtgaaccttg 960 
53 acggcacgat ccccgaagac aaccccgaga tcgagggcgt gcgtagccat atctttacct 1020 
55 atggccaccg taacccgcag ggcatcacct ttggccccga cggcaccatt tatgccaccg 1080 
57 aacacggccc cgatacggat gacgagctga acatcatcgc cggcggtggc aactatgggt 1140 
59 ggccgaatgt ggccggctat cgcgatggca aatcctatgt ctacgctgat tggagccaag 12 00 
61 cgcccgctga ccagcgttac accggtcgcg ccggtatccc cgacaccgtg ccgcaattcc 1260 
63 ccgagctgga attcgcgccc gagatggtcg atccgctgac aacctattgg acggtggata 132 0 
65 atgattacga tttcaccgcc aattgcggct ggatctgtaa tccgacgatc gcgccttcgt 13 80 
67 ctgcctatta ctatgcggcg ggcgagagcg gtatcgcggc ttgggataat tcgatcctga 1440 
69 tcccgacgct gaaacatggc ggcatctatg tgcagcacct cagcgatgat ggccaatctg 1500 
71 tcgacggcct gcccgagctg tggttcagca cccagaaccg ctatcgcgat atcgagatca 1560 
73 gccccgataa ccatgttttt gtggcgaccg acaactttgg cacctcggcg cagaaatatg 1620 
75 gcgagaccgg ctttaccaac gtgctgcata accccggcgc gatccttgtc tttagctatg 1680 
77 tcggcgagga tgctgcgggt cagaccggaa tgatgaccgc gcccgcaccg cagacgcaat 1740 
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79 acacgcaagt gcccgccgag ggtgcaggcg cgggcgcgac tgaggttgcg gatgtcgatt 1800 

81 acgacacgct gttcaccgaa ggccagaccc tttatggcag cgcatgtgcc gcgtgccatg 1860 

83 gtgccgctgg ccaaggtgcg cagggcccga cctttgtggg cgtgccggat gtgacgggtg 1920 

85 acaaggacta ccttgcccgc accatcatcc acggttttgg ctatatgccg tcgtttgcga 1980 

87 ctcggctgga tgacgaggag gttgccgcca tcgcgacctt tatccgcaac agctggggca 2040 

89 atgacgaagg catcctgacc ccggccgagg ccgctgccac ccgctgaatg ctgtaaaaac 2100 

91 caccctcgcc tgcacatcag gcgggggtat ttcatttatt ttcacatctg cctttgacat 2160 

93 gtgccgctat cacggttaat gcggcccttc ggctgttctg ggtctaagcg ggtgtgttgc 2220 

95 ccgataagag agacggttca gtccctcccg ccctatttag ggcccattta ggcagaatag 2280 

97 ttttgactca tcaaaatatc gccgcgcctc tggccgcggc cctttcgcaa cgtggatatg 2340 

99 aaacgctgac cgccgtgcag caagctgtgc ttgcgcccga ggctgatggc cgcgacctgc 2400 

101 tggtgtcggc acagaccggt tcgggtaaga cggtggcctt tggtatcgca gtcgcgcccg 2460 

103 accttttggg cgacgacaat atcctgccgc tgaacacgcc gcctgttgcg ctgttcatcg 2520 

105 cccccacgcg cgagcttgcg ctgcaagttg ctcaggaact gacctggctt tacgccaatg 2580 

107 caggtgccca gatcgcgacc tgcgtcggcg gtatggatta ccgcaccgag cgccgcgccc 2640 

109 ttgcacgtct gccgcaaatc gttgtcggca cgcccggccg tctgcgcgac catatcgacc 2700 

111 gtggcggcct tgacctgtcc gaattgcgcg tgaccgtgct ggacgaagcg gatgagatgc 2760 

113 tcgacctcgg cttccgcgat gatctgcaat atatcttgca agccgcgccc gaagatcgcc 2820 

115 gcacgctgat gttctcggcc accgtgccgc gcgagattga aaaactggcc cgcgacttcc 2880 

117 aaaatgacgc cctgcgtctg gaaacccgtg gcgaggccaa gcagcacaac gacatcagct 2 940 

119 accaagcttt gtcggtcacc atgcgcgatc gcgaaaacgc cattttcaac atgctgcgtt 3000 

121 tttatgaatc gcgcacggcg atcatcttct gcaagacccg cgccaatgtg aatgatctgc 3060 

123 tgtcgcggat gagcggtcgt ggcttccgcg tggtggccct gtcgggcgag ctgtcgcaac 3120 

125 aggaacgcac caacgcgctg caagcgctgc gtgatggccg cgccaacgtt tgtatcgcga 3180 

12 7 ccgacgtcgc ggcgcgcggc attgacttgc cgggcctcga gctggtgatc cactacgatc 3240 
129 tgccgaccaa tgccgaaacc ctgctgcacc gctcgggccg taccggccgc cgggtgccaa 3300 
131 gggcgtctcg gcgctgatcg tcacccccgg cgatttcaaa aaagcgcagc gtttgctgag 3360 
133 ctttgccaaa gtgaccgcgg aatggggcaa ggcgccttcg gccgaaga 3408 

136 <210> SEQ ID NO: 2 

137 <211> LENGTH: 609 

138 <212> TYPE: PRT 

13 9 <213> ORGANISM: Gluconobacter oxydans 
141 <400> SEQUENCE: 2 

143 Met Leu Pro Lys Ser Leu Lys His Lys Asn Gly Ala Met Arg Leu Val 

144 15 10 15 

147 Ala Ala Ser Thr Leu Ala Leu Met lie Gly Ala Gly Ala His Ala Gin 

148 20 25 30 

151 Val Asn Pro Val Glu Val Pro Val Gly Ala Asn Glu Thr Phe Thr Ser 

152 35 40 45 

155 Arg Val Leu Thr Thr Gly Leu Ser Asn Pro Trp Glu lie Thr Trp Gly 

156 50 55 60 

159 Pro Asp Asn Met Leu Trp Val Thr Glu Arg Ser Ser Gly Glu Val Thr 

160 65 70 75 80 

163 Arg Val Asp Pro Asn Thr Gly Glu Gin Gin Val Leu Leu Thr Leu Thr 

164 85 90 95 

167 Asp Phe Ser Val Asp Val Gin His Gin Gly Leu Leu Gly Leu Ala Leu 

168 100 105 110 

171 His Pro Glu Phe Met Gin Glu Ser Gly Asn Asp Tyr Val Tyr He Val 

172 115 120 125 
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Trp 


Gly 


Asn 


Asp 


Glu 


Gly He Leu 


Thr 


Pro 


Ala 


Glu Ala 


Ala 


Ala Thr 


292 






595 






600 








605 







2 95 Arg 

299 <210> SEQ ID NO: 3 

300 <211> LENGTH: 14 

301 <212> TYPE: PRT 

302 <213> ORGANISM: Gluconobacter oxydans 
305 <220> FEATURE: 

W--> 306 <221> NAME/KEY: X 

307 <222> LOCATION: (2) . . (2) 

308 <223> OTHER INFORMATION: X can be any naturally occuring amino acid. 
310 <220> FEATURE: 

W--> 311 <221> NAME /KEY : X 

312 <222> LOCATION: (4).. (4) / 

313 <223> OTHER INFORMATION: X is P or K. 
315 <400> SEQUENCE: 3 

W--> 317 Gin Xaa Asn Xaa Val Glu Val Pro Val Gly Ala Asn Glu Thr 
318 15 10 

321 <210> SEQ ID NO: 4 

322 <211> LENGTH: 31 

323 <212> TYPE: PRT 

324 <213> ORGANISM: Gluconobacter oxydans 
326 <400> SEQUENCE: 4 

328 Met Leu Pro Lys Ser Leu Lys His Lys Asn Gly Ala Met Arg Leu Val 

329 15 10 15 

332 Ala Ala Ser Thr Leu Ala Leu Met He Gly Ala Gly Ala His Ala 

333 20 25 30 

336 <210> SEQ ID NO: 5 

337 <211> LENGTH: 578 

338 <212> TYPE: PRT 

339 <213> ORGANISM: Gluconobacter oxydans 
341 <400> SEQUENCE: 5 

343 Gin Val Asn Pro Val Glu Val Pro Val Gly Ala Asn Glu Thr Phe Thr 

344 15 10 15 

347 Ser Arg Val Leu Thr Thr Gly Leu Ser Asn Pro Trp Glu He Thr Trp 

348 20 25 30 

351 Gly Pro Asp Asn Met Leu Trp Val Thr Glu Arg Ser Ser Gly Glu Val 

352 35 40 45 

355 Thr Arg Val Asp Pro Asn Thr Gly Glu Gin Gin Val Leu Leu Thr Leu 

356 50 55 60 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 



Seq#:7; N Pos . 9 
Invalid <213> Response; 

Use of °Artif icial p only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#:6,7,8,9 




file://C:\CRF4\OUTHOLD\VsrJ528675.htm 



4/4/05 



VERIFICATION SUMMARY DATE: 04/04/2005 

PATENT APPLICATION: US/10/528 , 675 TIME: 10:59:40 



Input Set : A:\0185653. ST25 • txt 

Output Set: N:\CRF4\04042005\J528675.raw 



L:ll M:270 C: Current Application Number differs, Replaced Current Application No 
L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 



L: 


306 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:3 


L: 


311 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:3 


L: 


317 


M: 


341 


W: 


(46) "n' 


1 or "Xaa" 


used, for SEQ ID#:3 


after pos . : 0 


L: 


501 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:6 


L: 


506 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:6 


L: 


511 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:6 


L: 


516 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:6 


L: 


534 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:7 


L: 


539 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:7 


L: 


549 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:7 


L: 


554 


M: 


257 


W: 


Feature 


value 


mis 


-spelled or 


invalid, 


<221> Name/Key for SEQ ID#:7 


L: 


559 


M: 


341 


W: 


(46) "n' 


! or "Xaa" 


used, for SEQ ID#:7 


after pos . : 0 
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